Automating Multi-Level Annotations of Orthographic Properties of German Words and Children’s Spelling Errors
نویسنده
چکیده
This paper presents the automatic annotation of orthographic properties of German words and spelling errors in texts of German primary school children according to a new multi-layered annotation scheme [1]. The scheme is closely linked to the principles of the German writing system and is supposed to allow the pursuit of new research questions concerning the relationship between spelling errors of competent and less competent spellers and the regularities of the German graphematic system. A novelty of the automatic annotation is that it takes an intended, correctly spelled word as input and applies a set of rules to generate a list of error candidates containing systematic spelling errors. As a further novelty, the annotation of additional wordand error-related properties is presented such as whether the spelling error changes the word’s pronunciation and whether a spelling can be derived from a related word form. This gives rise to more detailed analyses of the errors but also allows us to develop an application for learners that generates automatic advice for the correct spelling. A first evaluation shows that the automatic annotation of the presented categories and features can come close to human annotations.
منابع مشابه
Annotating Spelling Errors in German Texts Produced by Primary School Children
We present a new multi-layered annotation scheme for orthographic errors in freely written German texts produced by primary school children. The scheme is closely linked to the German graphematic system and defines categories for both general structural word properties and errorrelated properties. Furthermore, it features multiple layers of information which can be used to evaluate an error. Th...
متن کاملDesign and implementation of Persian spelling detection and correction system based on Semantic
Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors. Also developing Persian tools will provide Persian progr...
متن کاملChildren’s written and oral spelling
For adults, written spelling is generally superior to oral spelling. To determine whether the same holds true for children in kindergarten through second grade, we compared children’s ability to spell real words (Experiment 1) and nonsense words (Experiment 2) orally and in writing. Building on the work of Tangel and Blachman (1992, 1995) and others, we developed a reliable system to assess the...
متن کاملLearning to spell in Hebrew: Phonological and morphological factors
Learning to spell in Hebrew: Phonological and morphological factors This paper investigates children’s developing knowledge of the Hebrew spelling system in view of the claim that language-specific typology affects the rate and the pattern of development of orthographic spelling. Hebrew is a morphologically synthetic language with a phonologically “deep” orthography, on the one hand, and a cons...
متن کاملChildren's Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement
Within the scope of the SPACE project, the CHildren’s Oral REading Corpus (CHOREC) is developed. This database contains recorded, transcribed and annotated read speech (42 GB or 130 hours) of 400 Dutch speaking elementary school children with or without reading difficulties. Analyses of interand intra-annotator agreement are carried out in order to investigate the consistency with which reading...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016